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GRADIENT FLOWS OF PENALTY FUNCTIONS IN THE SPACE OF 

SMOOTH EMBEDDINGS 


DARA GOLD 

Abstract. Motivated by manifold learning techniques, we give an explicit lower bound for 
how far a smoothly embedded compact submanifold in can move in a normal direction 
and remain an embedding. In addition, given a penalty function P : Emb(M, K^) —>■ R on 
the space of embeddings, we give a condition which guarantees that the gradient VP of the 
penalty function is normal to at every point. 


1. Introduction 

In this paper we give some theoretical results and explicit estimates for gradient flows in 
inhnite dimensions, motivated by issues in manifold learning. These flows live in the space 
of embeddings Emb(M, M^) of a hxed fc-manifold M in We give a condition under 
which the gradient vector held for a functional on Emb(M, M'^) is pointwise normal to 4>{M) 
(where 0 G Emb(M,M-^)) (Theorem 1). There is also a lower bound in Theorem 3 for the 
existence of a linear how in a normal direction, which is the type of how used in numerical 
simulations. 

A fundamental problem in machine learning is to best approximate a hxed hnite set of 
data points in by a fc-dimensional embedded manifold. (See e.g., [12] and its references.) 
One standard approach to dehning an ideal approximation is to impose a penalty function 
on Emb(M, M^), where the penalty function measures the total distance from the manifold 
to the data points, the total intrinsic or extrinsic curvature of the embedding, the volume of 
the embedding, or some linear combination of these terms. (See [T] for the case of htting data 
points by a graph 0 : —)• M.) A minimal embedding for a given penalty function will 

in general not pass through all data points but will hopefully be more robust with respect 
to added points. 

In this setup, we have a penalty function P : Emb(M, R^) —)■ R on the space of smooth 
embeddings of a closed /c-manifold M into R^. From a Morse theory perspective, it is natural 
to study the negative gradient how of an initial embedding 00 , with the expectation that for 
generic (po, the long time how will approach a local minimum of P. 

This natural theoretical setup has many technical difficulties. One must choose the topol¬ 
ogy on Emb(M, R^), ideally the Frechet topology on C°° embeddings, and check that the 
penalty function is diherentiable enough to have a gradient. Moreover, the penalty function 
should contain a distance penalty term, and there are issues of continuity of the gradient 
vector held at points in 0o(Af) which are equidistant from two or more data points. Most 
importantly, it seems very difficult to show with existing nonlinear PDE techniques that 
the short time gradient how exists for even simple penalty functions. In addition, even if 
short time, hnite time and long time existence are all shown, it is not clear that a limiting 
point of a gradient line is in the original function space. In our setup, the crucial issue is 
that Emb(M, R'^) is an open subset of Maps(M, R-^) in any reasonable topology, so that it 
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would actually be surprising if the long time flow stays in the space of embeddings. Finally, 
because these penalty functions are not convex in general, any limit point need not be a 
global minimum for the penalty function. 

Despite these problems, the simulations in pQ using gradient flow seem robust and better 
than many other existing manifold learning methods. Therefore, it seems worthwhile to 
prove some results that address the theoretical difficulties. 

There is of course a large body of work on gradient flow techniques in infinite dimensions 
going back to Morse’s original papers. In applied math, there is seminal work of Osher 
and Sethian [10], who introduced the Level Set Method, by which a surface is treated as 
the level set of a function. This approach, very familiar in hnite dimensional Morse theory, 
avoids typical problems that arise with cusps and discontinuities in a flow whose speed is 
curvature dependent. In pure math, gradient flows are used to study harmonic map and mean 
curvature flow/curve shrinking problems; in these cases, the penalty function is an energy or 
volume functional. As some examples, Rupflin and Topping im study minimal immersions 
via gradient flow of the harmonic energy map paired with a flow of the Riemannian metric 
on the domain surface. For mean curvature flow, Hamilton [3] and Gerhardt [3] prove 
that convex, compact surfaces in Euclidean space along with curves in a plane contract 
smoothly to a point under mean curvature flow. Xiao [T3| gives a short time estimate for 
mean curvature flow for immersed star-shaped hypersurfaces in Euclidean space. Huisken 
and Sinestrari |6] consider compact hypersurfaces with positive mean curvature to study 
singularities than can arise during the flow. Using rescaling techniques now familiar in Ricci 
flow literature, they introduce a series of rescaled flows that approach a smooth flow. Finally, 
there is a huge body of work on Floer theory, where the energy functional’s critical points 
are pseudoholomorphic curves. 

There is some literature which blends theoretical and applied techniques. Although the 
gradient of a functional is typically computed using an inner product on the tangent space 
of the domain space, Mayer |8] uses a discretized approximation to the gradient flow, and in 
particular replaces the time derivative in the penalty flow equation with a hnite difference 
term. This leads to a short time existence result that seems so problematic in the smooth 
setting. In [2], there is a detailed discussion of the importance of choosing the right Sobolev 
space for applications to shape analysis. 

In this paper we present hrst steps towards the existence of gradient how in the space 
of embeddings. In §§1,2, we assume that M is closed. In §1, we prove that the gradient 
vector held is normal at each point in (plM) if and only if P is invariant under 

diheomorphisms of 4>{M). In §2, we consider a hxed normal gradient vector held Z^p along 
4>{M) and give an explicit estimate for how long the how of embeddings (pt = (p + tZ remains 
in the space of embeddings. 
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2. A Condition for Normal Gradient Vector Fields 

In this section, we prove an inhnite dimensional analogue of the standard hnite dimensional 
result that gradient vectors are perpendicular to level surfaces. 
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In the following theorem we use the gradient of the penalty function VP, which is dehned 
with respect to the inner product on M.^). For X G M^), the gradient 

is characterized by 

dP{X) = (VP, X)= [ VP ■ Xdvol, 

where the volume form is induced from and we are using the Euclidean dot product. 
VP’s being pointwise normal to 0(M) means that XP^(rn) ■ X^^rn) = 0 for all (j){m) G 0(M). 

Theorem 1. For a penalty function P : C°°(M, M^) —)■ M, the gradient VP will he normal 
to 0(M) for each m & M if and only if P is invariant under diffeomorphisms a : 0(M) —)■ 
(t>{M), that are in the path component of the identity in Diff((j){M)), i.e. P{a{(f){M))) = 
P(0(M)). 

Proof. («^) Assume P{a{(j){M))) = P{(j){M)) where a is a diffeomorphism from 4>{M) onto 
4>{M) that is generated from the flow of a time independent vector held on 4>{M). We know 
that XPtj)Q J-Lj for all X^^ that are tangent to the level set containing 00 £ M^). 

Claim: All vector helds G r(T0o(Af)) he tangent to the level set of 0o ^ C°°(M, M^). 

Proof of Claim: For 10,^ G F(T0o(d^)) we have an associated how along 4>{M) given 
by Oyt : and aY^t4>ijn) = Y^^rn)- Furthermore ay^t '■ 4>{.X[) —)■ 0(M) is a 

diheomorphism for all t. Therefore we can say 

D*,p(y) = d|,.„p(.#.,) = = 0 

where we have used the assumption and the fact that 000) = 0*. Note: the assumption of 
diheomorphism invariance could have been written as: 

P(00M)) = P(at(0(M))) = P(a,+,(0(M))) 
for f, r G and f = 0 in the last equality is the case used above. 

We conclude that VP^g ±^2 for all vector helds G F(T0o(M)). 

Fix 0(mo) G 0(M) and a vector (5(0(mo)) G T0(mo)0(d^)- Choose a sequence of smooth 
functions f^^ : 0(M) -)■ R such that /^fedvol = 1, supp/^^^ C B^^{(t){mo)) n 0(M) and 

Ck —)■ 0. (Here Pej.(0(mo)) is the ball of radius in R^ with center at 0(mo).) Dehne vector 
helds on 0(M) by 

■ <5(0("io))- 

Then we have 

0 = lim(VP*,K.,>= hm (VP*,/... QWmo))) 

Cfc-s-O cfc-^-O 

= lim f VP0(0(m)) ■ fe^Q{(l){mo)) = VP00(mo) ■ Q{(j){mo)). 

J0(M) 

Therefore VP^ T 10, pointwise. 

(^) Assume that VP^{m) T 0(Af) for all 0(m) G 0(M). This is equivalent to saying 
XPrp(m) T 10)(m) at each point 0(m) G 0(M) for all vector helds Y G F(T0(M)). This gives 

^|/=oP(0t) = 0, 4>t\t=o = 10 
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which means that moving in the direction of the flow ay^T generated by a fixed vector field 
Y is equivalent to moving along a level set in Because flows generated in this 

way are diffeomorphisms from to we can conclude that 

P(ay,i(0(M))) = P{m)) 


for all a, t, Y. 


□ 


3. An Estimate for Flows in Normal Gradient Directions 

The above result gives a condition for determining if the gradient vector field generated 
by a penalty function is normal at every point in 0(M). In the case where this is true, we 
would next like to consider how far 0(M) can move in a fixed normal gradient direction 
while remaining an embedding. The next set of results gives an explicit estimate for the 
lower bound of this flow. 

3.1 Notation and Definitions 

• e is the size of the neighborhood around 4>{M) in which each point has a unique clos¬ 
est point in (piM). The existence of this neighborhood for M closed is guaranteed by the 
e-Neighborhood Theorem [H Ch. 2, §3]. It is given explicitly in Lemma 3 in the proof of 
Theorem 3. 

• We will use two sets of coordinates on Standard coordinates will be denoted 

{x^, ... ,x^). We will also be representing points in (p{M) and in a small neighborhood 
around (p{M) as elements of the normal bundle N(j){M). In coordinates they will be given as 
{q ^,..., ..., where the first k components are manifold coordinates and the last 

N — k are coordinates for the normal space. These will be referred to as normal coordinates. 
For q G (p{M), its representation is {q^, ..., 0,..., 0). For w = (g^,..., g^, r^,..., 

inside a small neighborhood of 4>{M), q = (g^ ■ ■ ■ g^, 0 ■ ■ ■ 0) is re’s closest point in (p{M) and 
(0, • • • , 0, r^, • • • r^-^) = w — q e N^{M). 

• A vector in Nq(j){M) will be denoted as either tv{q) (where v is unit length) or as Ywi{q) 
where the {wi\ vectors are a unit length spanning set of the normal space at g. There are 
N — k {tCj} vectors, each with N coordinates. 

• For 0(M) C the map E : N(j){M) —)■ acts by E{q,r) = q + r (sending points to 

the end of perpendicular vectors in the normal bundle over (j){M)). It is given explicitly by: 

= (x^(g) -t-rX^(g),--- ,x^{q) + Ywf {q)) 

= + Ewl{q), • • • , 4>^{q) + Awf (g)), 

where the domain is in normal coordinates and the range is in standard coordinates. Points 
e = ge -|- Ue for which the Jacobian of the E map isn’t full rank (at the point (ge,Ue)) are 
defined as ’focal points.’ |9] 

• The inclusion map ^(M) —)■ takes points (g^, • • • , g^) •—t {x^{u), ■ ■ ■ ,x^{u)). It is a 

standard result that the first fundamental form is the matrix with entries {gij) = 

(Euclidean dot product) and the second fundamental form is the matrix with entries {v ■ Uj) 
where Uj is the normal component of the vector 

• In choosing coordinates that make the first fundamental form the identity matrix, the 
eiqenvalues Pi, • • • ,Pk of the second fundamental form are called the ‘principal curvatures’ 
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at g = (j){m) G 4>{M). Considering the normal line I = q + tv extending from q G 4>{M) [v is 
a fixed nnit normal vector at q) we have the proposition [9l p. 34]: 


Proposition 1. The focal points of ([0(M)],g) along I are precisely the points q + p^ ^v, 
where 1 < i < k,pi ^ 

• K = max P4)(m) where p^m) is the largest eigenvalne of (v^m) ■ hi) evalnated at 
q = (f){m) G 4>{M). 

• 5 is chosen snch that for d^N^x, y) < 6/2 {x, y G 0(M)) we know x + tv{x) ^ y + tv{y) 
for t < 6. It is dehned explicitly after the proof of Lemma 3. 

Note; The next two theorems are stated in terms of nnit length normal vector helds on 
4>{M). The Enler class of the normal bnndle is the obstrnction to the existence of snch a 
vector held. If this class is nonzero, we apply the theorem to vector helds where each vector 
has length at most one. 

Theorem 2. Let v be a normal vector field of length at most one along 4>{M) C and e 
be as defined above. = {0(m) + tn : m G M} is immersed in for t < e. 


Proof. We want to show that the map M 4>t{M) is an immersion for t dehned in the 
theorem statement, bnt becanse 0(M) is assnmed to be embedded in it snfhces to show 
that the map F : 4>{M) 4>t{M) (where for q G 4>{M), F{q) = q + tv{q)) is an immersion. 

We want to consider 0t(M) as sitting in an open snbset of R'^ that we can identify with the 
normal bnndle over 4>{M). In particniar, the e - Neighborhood Theorem [1] gives that on a 
compact, bonndaryless manifold in R^ -0(M) in onr case- there exists a snfhciently small 
e snch that for each point w in the set of points in R^ a distance less that e from the 
manifold- there is a nniqne closest point q in 4>{M). Furthermore w — q & Ng{(l){M)) where 
N(j){M) is the normal bundle over 4>{M). We can diheomorphically identify (locally) points 
in with elements in N(j){M) as follows: 

w {w — q)q 

where q is re’s unique closest point in 4>{M). When considering the case of our hxed vector 
held tv along 4>{M) as a section of the normal bundle we get the following coordinate 
representation of this section: 

0(m) -ftn(0(m)) ^ (g\ • • • ,q’",tv\q),-■ ■ ,tv^~\q)) 
where now the vector components are function of q. Therefore the map: 

F : C W 

has the normal coordinate representation: 
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1 ■■■ 0 \ 


0 ••• 1 


d{tv)^-*^{q) ___ a(to)"-fc(g) I 

dq^ / 

which has rank k, showing that the map taking 4>{M) 4>t{M) is an immersion for t < e. □ 

Next, we would like to show that 0* is injective, which along with its being an immersion 
(Theorem 2) and the assumption that M is compact is enough to conclude that (pt is an 
embedding. While Theorem 2 showed that 0* is an immersion for f < e. Theorem 3 will 
show injectivity for t < t*. Lemma 2 (included in the proof of Theorem 3) shows that t* < e. 
Therefore the hnal theorem showing 0^ is an embedding is on the interval t <t*. 

The statement of Theorem 3 uses the new value S which is dehned explicitly after the 
proof of Lemma 3. Recall that S is chosen such that for d^N{x,y) < S {x,y E (f>{M)) we 
know X + tv{x) ^ y + tv{y) for t < 5. 

Theorem 3. Let v be a normal vector field of length at most one along C Let 

t* = min{K~^,S/3}. Then fit '■ M ^ given by m fi{m) + tv{fi{m)) is an embedding 
for t <t*. 


the differential of which is given by: 


DF{q) = 


dq^jq) 

dq^ 


d{tv)'^ ^(q) 

a? 


dq^jq) 

dq'^ 


a{tvY-^{q) 




y 


/ 


Proof. It should be noted that we are interested in the injectivity of the map fit M ^ 
dehned above, but because fi{M) is embedded in it suffices to show that F : fi{M) -E- 
fit{M) is injective for t < t*. 

To view F as a map acting on open subsets of we dehne the function Fft from —)■ 

T^, the set of points a distance e — t and e from fi{M) in respectively. Setting tt : —)■ 
fi{M) with n{w) the closest point in fi{M) to w we can dehne: 

Hfiw) =w + 


Note that Ht\^(M) = F. 

We continue the proof with a series of Lemmas. 
Lemma 1. DHfiqo) is invertible for w = qo E fi{M) 


Proof. For Ht '. ^ ^ via w ^ w + tv{n{w)) its normal coordinate representation 

(explained in proof of Theorem 1) is given by: 


{q\ ... , • • • , r^-^) ^ {q\ • • • , + tn^(7r(g)), • • • , + tn^-^7r(g))) 


where it should be noted that the r*’s are independent of coordinates but the n*(g)’s are the 
coordinates for the hxed vector held along fi{M) which depend on q. For w = q^ E fi{M) 
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the differential of the Ht map (taken in coordinates) is given by: 

/ 


dqHm 


V 




9gi 


a(r"-'=+to"-'=(g))(g,0) 

1 


0 

djtv^ {q)){q,0) 
dq^ 


d{tv^ '^(q))(q,0) 


dq^(q,o) 

dq^ 


dqHm 


dq'‘(q,0) 

dq^ 

9(r^+tti^(g))(g,0) 

dq^ 


dr^ 


dq'^im 

dr^ 

d{r^+tv^ (g))(g,0) 
dr^ 


d(r^-'^+tv^-^{q)){q,0) (g))(g,0) 


dq^ 


1 0 

d{tvHq))iq,o) 1 

ag^ 


dr^ 


a(to"-fe(g))(g,0) 

dq^ 


0 


0\ 

0 
0 

ly 


dq^jqfi) 

0^n—k 


ag*^(g,0) 

fc 

a(r^+tDhg))(g,0) 

fc 


\ 


a(r" 




Qj>n — k 


Hq)){m 


This matrix is invertible for all t so we can conclude that there exists a ball of radius 
around qo, on which Ht is a diffeomorphism. □ 

Let dut = Luin . Although DHt is invertible for all time (the size of the neighborhood 

will change according to t), we must have t < e for Ht to be dehned. Therefore t is less than 
e and we can say: For x,y ^ 0(Af) with d]s_N{x, y) < Snt, we have x + tv{x) ^ y + tv{y) for 
t < e, and we can show injectivity: 

Lemma 2. Ht\^(M) is injective for t < t* = min{e, 


Proof. Assume instead that there exists some x,y ^ 0(Af) such that x + tv{x) 
and t < t*. We know by assumption that d]^N{x,y) > duf Therefore: 


dut < d9.M{x,y) 


= \x-y\ 

= \x — {x + tv{x)) + {x + tv{x)) — y\ 
= \x — {x + tv{x)) + {y + tv{y)) — y\ 

< \x - {x + tv{x))\ + \{y + tv{y)) - y\ 

= \fv{x)\ + \fv{y)\ = 2\t\ <2\t*\ 

< 2dHj3 


which is a contradiction. 


y + tviy) 


□ 


We now must compute e (the size of the neighborhood around 4>{M) within which each 
point has a unique closest point in (j){M)). Lemma 3 again uses d which is dehned explicitly 
following the proof. Recall: d is chosen such that for ds^N{x,y) < d {x,y E 0(M)) we know 
X + tv{x) ^ y + tv{y) for t < d. In the statement of Lemma 3, tv has been written in 
terms of unit length spanning vectors, wfs of the normal bundle N(j){M) with coefficients r® 
(1 < 7 < A^- A;). 



DARA GOLD 


Lemma 3. e = min{K ^,5/3} where 5 is such that for x,y E (f^M) and d^N{x,y) < 5 we 
have X + rl.Wi{x) ^ y + ryWi{y) where \rx\ < d and \ry\ < 5. 

Proof. Suppose there exists w E Y'^ such that there are two closest points x,y E 0(M). 

Then we can write w = x + rl.Wi{x) = y + PyWiiy) where \rx\ < e and \ry\ < e. We know by 

assumption that d-s^N{x,y) > S and we have a similar proof as in Lemma 2 : 

6<dKN{x,y) = \x-y\ 

= \x - {x + rl.Wi{x)) + {x + rl.Wi{x)) - y\ 

= \x - {x + rl,Wi{x)) + {y + PyWiiy)) - y\ 

< \x- {x + rl,Wi{x)) \ + \iy + PyWiiy)) - y\ 

= Kwi{x)\ + \rlwi{y)\ 

= \rx\ + \ry\ < 2e <25/?) 

which is a contradiction. □ 

We will obtain 5 in the following way: Recall E : N(j){M) —)■ acts on points in the 

normal bundle over 4>{M) by (g, r) t—)■ g + r. Here we will be considering the compact subset 
of N(j){M) which consists of vectors R such that |r| < .999iL“^. In coordinates, recall E is 
given by: 

= {x\q)+Pwl{q),--- ,x^{q)+Pwf{q)) 

= ( 0 ^ (g) + (g), • • • , 0 ^ (?) + (g)). 

Fix go = (go, • • ■ , go, 0, • • • ,0) E 0(M). For a point (go, vq) in the hber over go we know 

that DE{qo, ro) is invertible (see proof of Proposition 2) and therefore there is a ball of radius 

5(go,ro) around (qo,ro) on which E is a diffeomorphism. Because the hber over go is compact, 

we can let 5gg = min 5(go,ro) > 0 - 
ro 

Consider the set 

Ago = {? e 0 (M) : dffiiv (g, go) < 5gj2}. 

Then E is a diffeomorphism on the subset of N(j){M) given in normal coordinates by = 
{(g^ • • • ,g^,r^ • • • ,r’^“^)| |r| < 5qj2, (g^ • • • , g^, 0, • • • , 0) G as follows: For (gi,ri) G 
Bqo- 

l(?i,n) - (go, 0 )| = |(gi,ri) - (gi, 0 ) + (gi, 0 ) - (go, 0 )| 

< l(?i,c) - (?i, 0 )| + |(gi, 0 ) - (go, 0)1 
= ki| + l(?i, 0 ) - (go, 0 )| 

< 5gg/2 + 5gg/2 = 5gg. 

Therefore for (gi, ri), (g 2 , r 2 ) G Bg^ ((gi,ri) 7 ^ (g 2 ,r 2 )) we know (gi, 0), (g 2 , 0) G Ag^ and 
E{{qi,ri)) = gi + r\wi 7 ^ g2 + r^wi = E{{q2, r2)). 

We let 

(5 = inf 6gj2. 

go 

We can now say that for x,y E (f^M) and d^N^x, y) < 5 we have x + rl.Wi{x) ^ y + PyWiiy) 
for \rx\ <5 and |rj^| < 5 by construction. 
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It remains to compute ^(qo.ro) explicitly, from which we can get 6 with the method described 
above (Recall, ^( 5 o,ro) is fhe radius around {qo,ro) on which E is a diffeomorphism). We will 
compute 5(50,ro) using a quantitative version of the Implicit Function Theorem (adapted to 
the Inverse Function Theorem case), given as a proposition below. The formulation of the 
theorem, along with its proof is in the Appendix. 

For G G ,R^), let {qo,yo) G satisfy G{qo,yo) = 0. For fixed 7 > 0 let 

= {{q,y) e : \q - qo\ < 'J,\y - yo\ < 7 }- In the case where G{q,y) = E{q) - y, 
the following theorem is the adaptation of the Implicit Function Theorem to the Inverse 
Function Theorem (here the matrix norm ||A|| is the sup norm over the entries): 

Proposition 2. Assume that dgG{qo,yo) is invertible and choose 5° > 0 such that 
suP{q,y)(^v,^\^ - [dqG{qo,yo)]~^dqG{q,y)\ \ < 1/2. Let B^o = sup^q^y^^y^^\\dyG{q,y)\\ and M = 
\\dqGlqo,yo)~^\\. Let 5i = {2MBso)~^S^ and F^^ = {y e R"" : \\y - yo\\ < 5i}. Then in 
the case that G{q,y) = E{q) — y, the solutions to G{q,y) = 0(^ E{q) = y) in the set 
{(o’)!/) • Ik ~ Q'oll < 5°, Ik — i/olI < 5i} are given by {E~^{y),y). Alternatively, E is a 
diffeomorphism on E~^{Bs^{iif)) ABso{qo). 

We will apply the proposition to E : N(f){M) —)■ R^. Specihcally, in applying the propo¬ 
sition we have {{qo,ro),yo) as a base point (as opposed to simply writing {q,y) as in the 
proposition statement, we will write {{q,r),y) to emphasize use of normal coordinates), we 
have G{{q,r),y) = E{q,r) -y and G{{qo,ro),yo) = 0 (^ E{{qo,ro)) = yo). Therefore: 


d{q,r)G{{qo,ro),yo) = DE{qo,ro) 


( dA(qo,ro) I „idwl{qo,ro) 
/ agi "U A Qgi 


I d(f>^{qo,rQ) . i dwf(qo,ro) 

\ d<L ' EL 


Wn-kiQo) 


which is invertible for |r| < K~^ as required by the proposition’s assumption. Again, our goal 
is to get a d^qqyq) neighborhood around {qo,ro) on which E is a diffeomorphism. Following 
the proposition’s steps we have: 

Step 1: 


Bgo 


sup((5,^),y)eVo \\dyG{{q,r),y)\\ 

(90 ."^o) 


^^P((<7T),2/)eWo \\ 9 y{E{q,r) -y)\\ 

(10 •'^0) 


^^P((q,r),y)eV^0 

(90 ."^o) 


TOO 


-1 


1 , 


where we have taken the maximum of the absolute values of the matrix’s entries for the 
matrix norm. 


Step 2: 


^ = iid(q,r)G((qo,ro),yo) ^|| = ||T>^(go,ro) ^ 
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Using Cramer’s rule and the matrix adjugate to invert DE{qQ,ro), we have 

where DE^Qq, ro)*j is the (j, 2 :)th minor of DE{qQ, Tq), or the determinant of the (n — 1) x 
(n — 1) matrix constructed by deleting the jth row and zth column of DE{qo,ro), which 
gives an explicit way to compute M above. 

Step 3: 


We want to compute such that sup((g||1 - [DE{qo,ro)] ^DE{q,r)\\ < 

(90 .'^O) 

1 /2. Since this expression doesn’t rely on y, we need 5°^^ such that for \{q, r)| < ^ 

||1 — [DE{qQ,rQ)]~^DE{q,r)\\ < 1/2. To do this we can consider a hrst order Taylor series 
expansion on DE{q,r) around {qo,ro)- (Note: the j index in the second matrix below refers 
to coordinates in not an exponent.) We have: 

DE{q, r) 

/ d 4 ,^{ qo , ro ) I idwl { qo , ro ) j / n \ 


d(t>^(qo,ro) , i dwff{qo,ro) 
dq^ ' dq^ 

/ ^ 








Y.Rr 


d4’'^{qo) I i dwl{qq) 
dq^ U Qgl 


d<t>^{qo) I dw^jqo) 


K-AQo) 


wu; I r 
^1 ^ ' I 


0 9(jrl 


Wn-A^o) 


iV 

E Rf'^\q,r){z - ZoY 


N 

E Rf'^\(l,r){z - ZoY 

i=i 


E Rj YQ,r)(z- ZoY 


Y. RT'^YRrY^ - ZoY 


where z — Zo = A ~ Qo-, " ' lA ~ ~ ^ ~ have a uniform bound 

on the error term given by: 

\Rf"^\q,r) \ < max{ ^ < _gggK-\qe 0 (M)} '^Eg'W) 


For (/, m) with 1 < / < iV and 1 < m < /c, 
and A; + 1 < m < iV, //" = wY{q)- 


m _ d<p^{q) I ^j dwY^jq) 

1 r I oT7 • 


For (/, m) with 1 < / < iV 
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Plugging the above sum for DE{q,r) in the expression ||1 — [DE{qQ,ro)] ^DE{q,r)\\ we 
see that the hrst term cancels with the identity matrix and we are left with; 


N 


i=i 


[DE{qo,ro)] 


-1 


’ {(l,r){z-Zoy 


N \ 

^ Rf’^\q,r){z - ZoY ' 
i=i 


N 


Rf’^\q,r){z- ZoY 


N 


j:Rf'’"'\q,r){z-ZoY 


( 


VA 

N 


i=i 


/ 


N 


{[DE{qo,ro)\ i)(i,p) £ Rf^\q,r){z - ZoY ■ ■ ■ {[DE{qo,ro)] i)(i,p) £ Rf^\q,r){z - ZoY 


i=i 


N 


{[DE{qo,ro)] Y{n,p)Y. Rt^\qR){z - ZoY ••• {[DE{qo,ro)] Y{n,p)Y. RY’'" - ZoY 


i=i 


N 




V 


i=i 


i=i 


< 


N 


{[DE{qo,ro)\ ^)(i,p)(5(’g,,^o) E ••• {[DE{qo,ro)\ 

N N 

([DE(go,ro)]-^)(v,p)5(’go,^o) E Rf^\q,r) ■ ■ ■ ([DE(go, ro)]-i)(iv,p)5(’go,^o) E R^f’^\q,r) 

i=i i=i 

i ; (1) 

Letting = 2 n..x(|DE(M,„)l--)l.,|«GlP.-l ™ ‘I™* (l) 

(l,m) ’ 

exceed 1/2, as each entry has absolute value less than 1/2 by construction. 

Step 4: 

Now that we have a value for <5°^^ we can compute as in the proposition statement 


N 




V 




< 


by: 

where the last equality is from Step 1 and M is computed in Step 2. 

Step 5: 

By the theorem statement we know E is a diffeomorphism on 

P(,o,ro) = E-\Bsi^^^^Jyo))nBso^^^^Jqo,ro). 

In particular we need a ball of radius (5(qo,ro) around (go, lq) on which i? is a diffeomorphism. 
First, we need a such that for |(g,r) - (go,ro)| < Sfq^^ro) implies |E(g, r) - E(go, ro)| = 

\E{q, r) — yo\ < We can again compute this using a Taylor series expansion of 

E aronnd (gojTo). We have 

E{q,r) = E{qo,ro) + ( ^ i?j(g, r)((g, r) - (go,ro))^-- - , ^/?f (g, r)((g, r) - (go,ro))^) 
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where we have bounds on the error terms given by: 

+ r‘^Wi){q, r 


?")| < max 


Then we have 


dz^ 


: 1 < j < iV, g G (j){M),r < .999K 


_ 1 I def 


= GP. 


\E{q,r) - E(go,ro)|^ 

= - iQo,ro)y,--- ,'^Rf{q,r){{q,r) - {qo,ro)y)y 

3 3 

N N 

= - ( 9 o,ro))^T = I ^i?^(g,r)((g,r) - (go,ro))^f 

p=i j p=i j 

N N 

- “ (?o,r’o))^P < |G^((g,r) - (go,ro))^r 

p=i j p=i j 

N N N 

s E Z E E IG-P = E IG'L 

p=l j P=1 j P=1 


Therefore 


\E{q,r) - ^(go,ro)| 


N 


< 6 


3 

( 90 ,fo) 




/VE|G»P. 

p=l 


\Gpyj gives the required radius. We hnally set 

<5(,o,.o) = min{53^^ 

Finally, returning to the statement in Lemma 2, we had: is injective for t < 

t* = min{e, = min{iF“^, 5/3, ^}. By dehnition we know that for x,y ^ 0(^) and 
d]s^N{x,y) < 5 we have x + rywi{x) y y + ryWi{y) (where \ry < 5 and \ry < 5). However, 
we also have that for x,y ^ 0(^) satisfying d^iv (x, 1/) < Shi, x + tv{x) y y + tv{y) for 
t < e < 6. Therefore we can say that 6 < dut- This is because our specihc vector field tv 
gives a particular set of r*’s at each point, allowing for a larger diffeormorphic neighborhood 
around the base point than a neighborhood that works for all set of r*’s. Therefore we have 
is injective for t < t* = min{e, = min{iF“^, |, = min{iF“^, |} as required. 

We have shown that (pt is an injective immersion for t <t* (by the fact that t* < e). Since 
M is compact is an embedding. 

This concludes the proof of Theorem 3. □ 


and letting (\/^ 


Appendix: The Quantitative Implicit Function 

This quantitative version of the Implicit Function theorem and its variation of standard 
proof techniques is due to Calangelo Liverani [7]. 

Fix n, m e N and E G M™) and let (xo,Ao) G x M” satisfy F(a;o,Ao) = 0. 

For 5 > 0 let W = {(x. A) G : ||x — XqH < 5, ||A — Ao|| < 5}. 

Theorem 4. (Quantitative Implicit Function Theorem) Assume that dxE(xo, ^o) in¬ 
vertible and choose 5 > 0 such that sup(^^ x)eVs 111- [a,F(a;o,Ao)]-'a,F(a;,A)|| < 1/2. Let 
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Bs = sup^^^^)(,Vs\\9xF{x,X)\\ and M = | |(9a;F(a;o, Ao)"^| |. Set 6i = {2MBs)~^5 andVs^ = 
{A G M” : ||A — Aoll < Then there exists g G such that all the solutions 

of the equation F{x,\) = 0 in the set {(t, A) : ||A — Ao|| < 5i, ||a; — xqH < 5} are given by 
{g{\),\). In addition, dxg{X) = -{d,,F{g{X), X))-^dxF{g{X), X) 

Proof: Set A{x, A) = dxF{x, A), M = | |A(a;o, Ao)“^| |. 

We want to solve the equation F{x,X) = 0. Let A be such that ||A — Ao|| < < S. 

Consider Us = {x E : ||a; — XqH <5} and the function Q : Us ^ dehned by 

nA(T) = X- A{xo, Xo)~^F{x, A). 

For X E U{X), F{x, A) = 0 is equivalent to a; = hlA(T). 

Next, 

\\Qx{xo)-^Xo{xo)\\<M\\F{xo,X)\\<MBs6i 
In addition, ||(9a;f2A|| = ||1 — Ao)“^A(a;, A)|| <1/2. Thus 

1 1 

||nA(T) - Toll < -||t - Toll + ||flA(To) “ To|| < -||t - To|| + MBsSi < S 

The existence of x E Us such that flA(T) = x follows by the Fixed Point Theorem. We have 
therefore obtained a function g : Ts^ = {X : ||A — Ao|| < 5i} -E M’” such that F{g{X), A) = 0. 
It remains to prove regularity. Let A, A' G F^^. From above we have 

119(A) - 9(A')II < 1||9(A) - 9(A')II + MB,\\- \'\ 

This yields the Lipschitz continuity of the function g. To obtain the differentiability we note 
that, by the differentiability of F and the above Lipschitz continuity of g, for h E small 
enough, 

||F(^(A + h),X + h)- F{g{X), A) + dxF[g{X + h) - g{X), h] + dxF{g{h), h)\\ = o(||h||) 
Since F{g{X + h),X + h) = F{g{X), A) = 0 we have 

hm| |h| l"^! |c/(A + h)- g{X) + [dxF{g{h), h)]-^dxF{g{h), h)11 = 0, 

Al —>-0 

which concludes the proof. 
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